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CP-odd scalars are an integral part of many extensions of the Standard Model. Recently, 
electroweak-scale pseudoscalars have received increased attention in explaining the diffuse gamma- 
ray excess from the Galactic Centre. Elusive due to absence of direct couplings to gauge bosons, 
these particles receive only weak constraints from direct searches at LEP or searches performed 
during the first LHC runs. We investigate the LHC’s sensitivity in observing a CP-odd scalar in 
di-top associated production in the mass range 20 < < 100 GeV using jet substructure based re¬ 

construction techniques. We parametrise the scalar’s interactions using a simplified model approach 
and relate the obtained upper limits on couplings within type-I and type-H 2HDMs as well as the 
NMSSM. We find that in di-top-associated production, experiments at the LHC can set tight limits 
on CP-odd scalars that fit the Galactic Centre excess. However, direct sensitivity to light CP-odd 
scalars from the NMSSM proves to remain challenging. 


I. INTRODUCTION 

The recent discovery of the Higgs boson mm marked a new era for fundamental physics. For the first time an 
electroweak-scale scalar resonance has been discovered, supposedly a remnant of the mechanism underlying electroweak 
symmetry breaking [3]. 

While elementary scalar particles have been observed in nature for the first time, they are often an integral part of 
Standard Model (SM) extensions, e.g. Supersymmetry or general N-Higgs Doublet Models. When these extensions 
contain complex scalar fields, as a result, CP-odd scalars are introduced to the spectrum of the theory. Hence, since 
their existence would be evidence for physics beyond the SM, searches for CP-odd scalars are at the core of the current 
LHC program. 

Recently, CP-odd scalars as mediators between Dark Matter (DM) and SM particles have received attention in 
explaining the diffuse gamma-ray excess from the Galactic Centre [Mi! in the contexts of the so-called Coy Dark 
Matter models |5HTU] , and the Next-to-Minimal-Supersymmetric-Standard-Model (NMSSM) |TT1 [T3] . Hence, they 
are included as mediators in simplified models by the ATLAS and CMS collaborations to recast searches for jets and 
missing transverse energy (‘monojet’) during the upcoming LHC runs |13[ I14j . 

In contrast to the widely accepted paradigm that new physics particles have to be heavy, i.e. masses of 0(1) TeV or 
beyond, the mass of CP-odd scalars is almost unconstrained by direct searches. As interactions between gauge bosons 
and CP-odd scalars are only induced via higher-dimensional operators, e.g. limits from LEP are fairly 

weak. The main collider sensitivities may mainly arise from bottom quark or top quark-associated productions (for 
recent explorations, see [mds]). Further, due to the predicted velocity suppression in direct detection experiments 
for CP-odd scalar mediators, even light CP-odd scalars are still in agreement with experimental observations. Some 
constraints from flavour physics exist but limits are again weak if rriA ^ 5 GeV assuming the CP-odd scalar 
interacts with fermions in agreement with the hypothesis of minimal flavour violation |18] . 

Therefore, indirect detection experiments and direct searches at the LHC appear to be the most sensitive ways to 
search for the existence of electroweak-scale CP-odd scalar particles. In this paper, instead of previously explored 
paths of searching for CP-odd scalars in gluon-fusion production imiig, we focus on the direct production of such 
particles in association with a top quark pair and subsequent decay into a bottom quark pair, pp —^ tiA —>■ tibb. Thus, 
we derive limits on the mass and coupling strength of the CP-odd scalar in a process with unsuppressed fermion 
couplings only.^ 

This paper is organised as follows. In Sec. |ll| we briefly outline the way we incorporate the CP-odd scalar into 
the theory, using a simplified model approach. The event generation and details of the final state reconstruction are 


^ We note that, if the pseudoscalar couples for example in a universal way to fermions as part of a UV-complete model, thereby not 
respecting Yukawa-like coupling hierarchies, other production and decay channels might be more sensitive. However, the analysis we 
provide is still valid as a subset of possible search channels. 
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described in detail in Secs. Ill and |IV| In Sec. [V] we derive limits on the mass of the CP-odd scalar and its couplings 
to top quarks. Such limits can be applied to models where the CP-odd scalar arises as part of a Higgs multiplet. We 
recast these limits in the context of the 2HDM and the NMSSM in Sec|VI| Finally, in Sec|VII| we offer conclusions. 


II. SIMPLIFIED MODEL 


While CP-odd scalars are present in many extensions of the SM, for simplicity and generality of our results, we 
use a simplified model approach [21] to parametrise the contribution of this particle in the process pp —>■ tiA —> tibb. 
More precisely, we add couplings of the CP-odd scalar with the bottom and top quarks to the full SM Lagrangian 


= -^SM + -^CP- 


odd 7 


( 1 ) 


where 


^CP-odd 


■ ffm T , < 


■dbVb T , < 


( 2 ) 


and Pi (i = t, b) parametrises the deviation from the SM Yukawa coupling pi = mijv. 

Recently, a similar approach was proposed to recast monojet searches at the LHC in terms of scalar mediators 
between the SM and a secluded sector [22lI25j . In a similar way, we will focus on the minimal set of free parameters 
relevant to the process considered. Throughout this paper we will assume H to be a narrow resonance with 2mt, < 
rriA < 2mt. Hence, in our approach the CP-odd scalar decays exclusively into bottom quarks with B{A ^ bh) = 1 
and its width Pa is completely determined by the value of pb- For a narrow resonance, the kinematic distributions 
are expected to remain largely independent of the value of Fyi. 


III. EVENT GENERATION AND SIMULATION DETAILS 


A. Signal and background modelling 


Signal and background samples corresponding to pp collisions at i/s = 14 TeV are generated using the MadgraphS 
2.1.1 [26] leading-order (LO) generator and the CTEQ6L1 [27| set of parton distribution functions (PDF), interfaced 
to Pythia v6.427 [55j for parton showering and fragmentation and using the Perugia2011C [531 underlying event 
tune. In all cases, a top quark mass of 172 GeV is assumed and top quarks are decayed inclusively by Pythia. 

Samples of tiA signal events are generated for different values of the A boson mass, ttia = 20,30,40,60,80 and 
100 GeV, and assuming pt = 1 and B{A bb) = 1. A model corresponding to the Lagrangian shown in Eq. [^is 
implemented using Feynrules 2.1 |3D| and imported as UFO model |3T] in MadgraphS. The LO signal cross section 
predicted by MadgraphS (see Table|l]) is scaled by a k-factor of 1.3. This k-factor is obtained as the ratio of the NLO 
to LO cross sections for tih production, where h is a CP-even Higgs boson. It has been checked that this k-factor 
is rather constant as a function of m/j, varied between 20 and 125 GeV. Figure [^a) compares the production cross 
section between tih and tiA as a function of the Higgs boson mass, in both cases assuming pt=i- The ratio between 
both cross sections varies significantly versus mass, with the tih cross section being about a factor of 20 larger than 
the tiA cross section at a mass of 20 GeV, and only about a factor of two larger at a mass of 120 GeV [32]. This 
difference results from the presence of the extra 75 factor in the interaction between a CP-odd Higgs boson and the 
top quark, compared to the case of a CP-even Higgs boson. Another consequence of the different interaction is that a 
CP-odd Higgs boson has a substantially harder pT spectrum compared to the CP-even case, particularly at low mass, 
as illustrated in Fig. Sb). This is a key feature exploited in this analysis, as discussed in Sec. jlV] 


tua (GeV) 

20 

30 

40 

60 

80 

100 

cr’"°(ttA) (pb) 

0.46 

0.42 

0.39 

0.32 

0.27 

0.23 


TABLE I: Leading-order cross section for ttA production in pp collisions at y/s = 14 TeV as a function of the A boson mass 
niA- As discussed in the text, this LO cross section is obtained assuming gt = 1 and will be multiplied by a k-factor of 1.3 to 
approximate the NLO cross section. 


A large sample of tt-|-jets background events is generated with up to two additional partons in the 5F scheme (i.e. 
including b- and c-quarks). To avoid double-counting of partonic configurations generated by both the matrix-element 
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(a) (b) 


FIG. 1: (a) Comparison of the leading-order cross section for tth (solid line) and ttA (dashed line) in pp collisions at ^/s = 14 
TeV as a function of Higgs boson mass. In both cases a value of = 1 is assumed, (b) Comparison of the Higgs boson pt 
between tth (red) and tiA (black) for two different values of the Higgs boson mass, 20 GeV (solid) and 100 GeV (dashed). 


calculation and the parton shower, a parton-jet matching scheme (“MLM matching”) is employed. The sample 
is normalised to a cross section of 990 pb obtained using TopH — h v2.0 [34] at next-to-next-to-leading order (NNLO) 
in QCD, including resummation of next-to-next-to-leading logarithmic (NNLL) soft gluon terms [55H55] . and using 
the MSTW 2008 NNLO HOJ HI] PDF set. The tt+jets sample is generated inclusively, but events are categorised 
depending on the flavour content of additional particle jets in the event (i.e. jets not originating from the decay of the 
tt system). Particle jets are reconstructed with the anti-fct [42ti44] algorithm with a radius parameter R — 0.4 and are 
required to have px > 15 GeV and \r]\ < 2.5. Events where at least one such particle jet is matched within AR < 0.4 
to a 6-hadron with px > 5 GeV not originating from a top quark decay are generically labelled as tt-|->16 events. 
Similarly, events where at least one such particle jet is matched within AR < 0.4 to a c-hadron with px > 5 GeV 
not originating from a W boson decay, and that are not labelled already as ti+>lb, are labelled as tt+>lc events. 
Events labelled as either tt+>lb or tt-|->lc are generically referred to below as tt-|-HF events, where HF stands for 
“heavy flavour”. We do not apply dedicated corrections to the normalisation of tt-|-HF events, since Run 1 searches at 
the LHC |45| showed that the LO prediction from MadgraphS using the same settings as us is consistent with data 
within ^ 20%, and a larger systematic uncertainty will be assumed in this study. As in Ref. [45], a finer categorisation 
of tt-l-HF events is considered for the purpose of assigning systematic uncertainties associated with the modelling 
of heavy-flavour production in different topologies. In this way, a distinction is made between events with only one 
extra heavy-flavour jet satisfying the above cuts (referred to as ti+b or tt+c), events with two extra heavy-flavour jets 
(referred to as tt+bb or tt+cc), and events with one extra heavy-flavour jet containing two 6- or c-hadrons (referred to 
as tt+B or tt+C). The remaining events are labelled as tt-flight-jet events. Including those with no additional jets. 

Additional background samples corresponding to tiW, tiZ and tthsm production, where /ism is the SM Higgs boson, 
are also produced. The tiW sample is generated requiring at least one W boson in the event to decay leptonically, 
and is normalised to the corresponding LO cross section, 0.404 pb, times a k-factor of 1.4 [15]. The tiZ sample 
is generated requiring Z ^ qq decays and is normalised to the corresponding LO cross section, 0.353 pb, times a 
k-factor of 1.3 [15]. Finally, the tthsM sample is generated assuming ruh = 125 GeV and requiring h ^ bb decays. 
It is normalised to the NLO cross section [l7Hl9] . 0.611 pb,_times the /ism —t bb branching ratio of 57.7% [50U53] . 
collected In Ref. [51] . In these samples Z ^ qq and /ism —t bb decays are performed by MadgraphS and top quarks 
and W bosons are decayed by Pythia. 


B. Event reconstruction 

The generated samples at the particle level are processed through a simplified simulation of the detector response 
and object reconstruction. 
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Isolated leptons (electrons or muons) are required to originate from a W boson or r-lepton decay and to have 
Pt > 25 GeV and It^I < 2.5. Furthermore, they are required to not overlap with jets, as discussed below. A typical 
per-lepton identification efficiency of 80% is assumed. 

Stable particles from Pythia, except for muons and neutrinos, are processed through a simplified simulation of a 
calorimeter. The four momenta of particles falling within the same window in r]-(f> space of size A 77 x At/) = 0.1 x 0.1 
are added together to simulate the finite granularity of calorimeter cells. For each cell, the total three momentum is 
rescaled such as to make the cell massless. Cells with energy larger than 0.1 GeV and |? 7 | < 5.0 become the inputs to 
the jet algorithm. Several types of jets are considered in this analysis. 

The anti-fct algorithm is used to reconstruct jets with two different radius parameters, R = 0.2 and R = 0.4, 
referred to as AKT2 and AKT4 jets respectively. The minimum jet pt threshold for reconstruction is 5 GeV. During 
jet reconstruction, no distinction is made between identified electrons and jet energy deposits, and so every electron 
is also reconstructed a jet. In order to remove this double counting, if any of the jets in the AKT2 and AKT4 
collections lie within AR = 0.2 of a selected electron, the closest jet from each jet collection is discarded. Since 
this analysis has a large number of 6 -quark initiated jets, for which a significant fraction of energy is carried away 
by muons in semi-muonic 6 -hadron decays, the four momenta of all reconstructed muons with px > 4 GeV that are 
ghost-associated |55l 156) to a jet are added to the calorimeter jet four momentum. After this correction, a minimum 
Pt requirement of 15 GeV and 25 GeV is made for AKT2 and AKT4 jets respectively. All jets are required to satisfy 
\ri\ < 2.5. Finally, any electron or muon within AR = 0.4 of a selected AKT4 jet is discarded. In this analysis AKT4 
jets are used to define the minimum jet multiplicity required in the event selection, while AKT2 jets are used to define 
the 6 -tag multiplicity of the event. The latter is particularly important since at low mA values the 6 -quarks from the 
A ^ bb decay emerge with small angular separation. The flavour of an AKT2 jet is determined by matching it within 
AR = 0.15 with a 6 -hadron or a c-hadron (not originating from a 6 -hadron decay), resulting in the jet being labelled 
as 6 -jet or c-jet respectively. The rest of the jets are taken to originate from the fragmentation of a light quark or 
gluon and are labelled as “light jets”. Heavy-flavour tagging is modelled in a probabilistic fashion by assigning a 
per-jet efficiency of 70% to 6 -jets, 20% to c-jets, and 0.7% to light jets. 

In addition, jets are reconstructed with the Cambridge-Aachen (C/A) algorithm in order to reconstruct 

the A —>■ 66 decay, taking advantage of the boost with which A bosons are produced in the ttA process. Two radius 
parameters are considered for C/A jets, Rp^^ = 0.6 and 0.8, referred to as CA 6 and CA 8 jets respectively. The choice 
of radius for C/A jets is optimised in order to optimally reconstruct the ttA signal depending on the value of mA- 
In order to minimise the impact of soft radiation and pileup (not modelled in this analysis), the mass-drop (a.k.a. 
BDRS) filtering algorithm (^[SD] with the following parameters, //ftac = 0.67 and j/cut = 0.09 [ST], is applied to the 
reconstructed C/A jets. A semi-muonic energy correction to the C/A jet four momentum is also applied, as in the 
case of AKT2 and AKT4 jets. 


IV. EXPERIMENTAL ANALYSIS 
A. Analysis strategy and event selection 

This search is focused on the tiA W^bW~hbb process, with one of the W bosons decaying leptonically and 
the other W boson decaying hadronically. Only electrons or muons originating from W boson or r-lepton decays 
are considered. The resulting final state signature is thus characterised by one electron or muon, and high jet and 
6 -jet multiplicity that can be exploited to suppress the background, dominated by tt+jets production. Therefore, the 
following preselection requirements are made: one electron or muon, >5 AKT4 jets and >3 AKT2 6-tagged jets, in the 
following simply referred to as >5 jets and >3 6-tags. In order to optimise the sensitivity of the search, the selected 
events are categorised into two separate channels depending on the number of 6-tags (3 and >4). The channel with 
>5 jets and > 4 6-tags has the largest signal-to-background ratio and therefore drives the sensitivity of the search. 
It is dominated by tt-THF background. The channel with 3 6-tags has significantly lower signal-to-background ratio 
and the background is enriched in tt-|-light-jets. The simultaneous analysis of both channels is particularly useful 
to calibrate in-situ the tt-|-jets background prediction (including its heavy-flavour content) and constrain the related 
systematic uncertainties, as it will be discussed in Sec. |IV C[ This is a common strategy used in many experimental 
searches in the ATLAS and CMS collaborations [IHl 1621163) . which we mimic here in order to obtain more realistic 
projected sensitivities. 

An extra handle is provided by the significant boost of the A boson in a fraction of signal events, which results 
in the two 6-jets from the A ^ bb decay emerging with small angular separation between them. This is particularly 
relevant for low mA values, as shown in Fig. As a result, the A boson decay products can be reconstructed into a 
single fat jet, whose mass distribution would show a resonant structure peaked at the correct mA value. This feature 
is also very powerful to discriminate against the background. Therefore, a further requirement is made to have at 
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(a) (b) 

FIG. 2: Distribution of AR between the two 6-quarks from the A ^ bb decay prior to any selection requirements, for different 
values of m^: (a) rriA = 20, 30 and 40 GeV, and (b) tiia = 60,80 and 100 GeV. 


least one C/A BDRS-flltered jet with radius parameter Rp^ and minimum px depending on the rriA hypothesis being 
tested. In order to correctly reconstruct a significant fraction of the signal while rejecting as much background as 
possible, CA6 jets are used for ruA < 40 GeV, while CAS jets are used for higher toa values (up to 100 GeV). The 
minimum px requirements on the C/A jets are 60, 100, 120, 150, 200 and 250 GeV for tua = 20,30,40,60,80 and 
100 GeV, respectively. As shown in Fig. for high values of mA only a small fraction of signal events would have 
the A decay products contained within the CAS jet. The small signal acceptance comes with the benefit of improved 
background rejection and the ability to reconstruct the A boson mass, desirable in such simple analysis. However, it 
is expected that a dedicated multivariate analysis focused on the sample rejected by this analysis, similar in spirit to 
the ATLAS and CMS searches for the SM Higgs boson in tth, h ^ bb [33[S2], could also achieve significant signal 
sensitivity at high uia- Evaluating this possibility is beyond the scope of this study. The number of 6-tags inside 
the C/A jet is determined by matching the 6-tagged AKT2 jets within a cone of radius AR = 0.75R^^^. Finally, 
a requirement is made is that the C/A jets have > 2 6-tags inside. In case of more than one selected C/A jet, the 
leading px one is chosen. 

Table [h| presents the expected yields for signal and the SM backgrounds per fb“^ of integrated luminosity as a 
function of the selection cuts applied in each of the analysis channels under consideration; (>5j, 3b) and (>5j, >4b). 
In the case of the (>5j, 3b) channel, the dominant background after final selection is tt-|-light-jets, where typically the 
two 6-quarks from the top quark decays, as well as the c-quark from the TV —>■ cs decay, are 6-tagged. In contrast, in 
the (>5 j, >4 b) channel half of the background is tt-|->16, with ti+bb being its leading contribution. The rest of the 
background is approximately equally split between tt+>lc and tt-|-light-jets. In this table the expected contribution 
from ttA signal is obtained under the assumptions of = 2 and B{A —>■ 66) = 1. Both analysis channels have 
approximately the same amount of signal, while the background is about a factor of four higher in the (>5j, 3b) 
channel than in the (>5j, >4b) channel. Together with the different composition of the background, the very different 
signal-to-background ratio between both channels is the primary motivation for analysing them separately. 

The final discriminating variable is the invariant mass of the selected C/A jet, referred to as “leading BDRS jet 
mass”. Figures]^ and 1^ show the expected distribution of the BDRS jet mass for signal and background in each of 
the analysis channels, for the different toa values considered. The distributions correspond to ^/s = 14 TeV and are 
normalised to an integrated luminosity of 30 fb“^. For the assumed values of pt = 2 and B{A —>■ 66) = 1, the signal 
is clearly visible on top of the background. 


B. Systematic uncertainties 

Several sources of systematic uncertainty are considered that can affect the normalisation of signal and background 
and/or the shape of the BDRS jet mass distribution. Individual sources of systematic uncertainty are considered 

























6 



tt-|->16 

tt+>lc 

tt-l-light-jets 

tt + X 

Total bkg. 

ttA 

rriA = 30 GeV 

1 lepton 

4107 

10958 

155048 

299 

171072 

377 

>5 jets 

3109 

7078 

01800 

215 

72808 

208 

3 6-tags 

700 

705 

2702 

30.1 

4203 

72.4 

>1 CA6 jets 

510 

502 

1485 

21.4 

2518 

55.7 

>2 6-tags in selected GAO jet 

45.1 

38.4 

159 

1.9 

245 

14.6 

>4 6-tags 

234 

100 

128 

10.0 

474 

28.7 

>1 CA6 jets 

171 

70.1 

75.7 

7.9 

325 

23.8 

>2 6-tags in selected GAO jet 

30.9 

13.2 

18.5 

1.5 

70.2 

11.7 

niA = 80 GeV 

1 lepton 

4107 

10958 

155048 

299 

171072 

240 

>5 jets 

3109 

7078 

01800 

215 

72808 

198 

3 6-tags 

700 

705 

2702 

30.1 

4203 

57.5 

>1 CAS jets 

252 

240 

040 

11.5 

1155 

23.0 

>2 6-tags in selected CAS jet 

32.3 

32.8 

125 

2.0 

192 

6.1 

>4 6-tags 

234 

100 

128 

10.0 

474 

25.0 

>1 CAS jets 

91.0 

30.4 

35.0 

4.3 

107 

11.0 

>2 6-tags in selected CAS jet 

25.8 

10.0 

12.0 

1.5 

50.4 

5.3 


TABLE II: Expected signal and SM backgrounds at ^/s = 14 TeV per fb“^ of integrated luminosity as a function of the selection 
cuts applied in each of the analysis channels under consideration (see text for details): (>5j, 3b) and (>5j, >4b). The signal 
prediction is obtained under the assumptions of gt = 2 and B{A ^ bb) = 1. Several background categories have been merged 
for readability. The sum of tt+W, tt+Z and tt+hsM is denoted as tt+X. The yields shown correspond to the optimised 
selections for two different values of tua, 30 GeV and 80 GeV. Shown in bold are the signal and backgrounds expectations after 
full selection in each of the analysis channels considered. 


uncorrelated. Correlations of a given systematic uncertainty are maintained across processes and analysis channels. 
The choices of what uncertainties to consider and their magnitude are inspired by recent tt+hsM, hsM bb searches 
at the LHC [45] . 

A 15% normalisation uncertainty is assigned to tt+light-jets corresponding to the modelling of the jet multiplicity 
spectrum. A 30% normalisation uncertainty is assigned to each of the tt+HF components {tt+b, ti+bb, ti+B, tt+c, 
tt+cc, tt+C), and taken to be uncorrelated among them. These uncertainties are expected to be conservative given 
the recent progress in NLO predictions for tt with up to two jets merged with a parton shower [64], as well as NLO 
predictions for tt+> lb production in the 4F scheme matched to a parton shower [65] . Cross section uncertainties 
for tt+W, tt+Z and tt+hsM are taken to be 30% for each process. Uncertainties associated to jet energy and mass 
calibration are taken to be 5% per jet, fully correlated between energy and mass and across all jets in the event. 
Finally, uncertainties on the b-, c- and light-jet tagging efficiencies are taken to be 3%, 6% and 15% respectively. 
These uncertainties are taken as uncorrelated between 6-jets, c-jets, and light-jets. As shown in Figs, [^andj^ the 
resulting total background normalisation uncertainty is about 20%, although the different uncertainty components 
have different shape in the final distribution. 


C. Statistical analysis 

The BDRS jet mass distribution in the two analysis channels under consideration (see Figs. and are tested for 
the presence of a signal. To obtain the most realistic possible sensitivity projection, a sophisticated statistical analysis 
is performed, following very closely the strategy adopted in the experimental analyses at the LHC. 

For each itia hypothesis, 95% CL upper limits on the tiA production cross section times branching ratio, a{ttA) x 
B{A ^ bb), are obtained with the CLg method [Ml EZ] using a profile likelihood ratio as test statistic implemented 
in the RooFit package [Ml ESj- The likelihood function C { fi , 9 ) depends on the signal-strength parameter /r, a 
multiplicative factor to the theoretical signal production cross section, and 6, a set of nuisance parameters that 
encode the effect of systematic uncertainties in the analysis. The likelihood function is constructed as a product of 
Poisson probability terms over all bins of the distributions analysed, and of Gaussian or log-normal probability terms. 
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niA = 20 GeV 


rriA = 30 GeV 


niA = 40 GeV 








FIG. 3: Distribution of the leading BDRS jet mass in the two analysis channels considered after final selection: (top) (>5j, 3b) 
and (bottom) (>5j, >4b), for different values of rriA (20, 30 and 40 GeV). The prediction corresponds to ^/s = 14 TeV and 
an integrated luminosity of 30 fb“^. Several background categories have been merged for visibility. The expected contribution 
from the tiA signal under the assumptions of f/t = 2 and B{A ^ bb) = 1 is also shown (red histogram), stacked on top of the SM 
background. The dashed red line shows the UA signal distribution normalised to the background yield to better compare the 
shape to that of the background. The bottom panel displays the expected total systematic uncertainty on the total prediction 
prior to the fit to the pseudo-data. 


each corresponding to a nuisance parameter. For a given assumed value of the profile likelihood ratio is defined 
as: 


q^ = (3) 

where are the values of the nuisance parameters that maximise the likelihood function for a given value of and 
/t and 9 are the values of the parameters that maximise the likelihood function (with the constraint 0 < /i < /r). 
The maximisation of the likelihood function over the nuisance parameters allows variations of the expectations for 
signal and background in order to improve the agreement with (pseudo-)data, yielding a background prediction with 
reduced overall uncertainty and thus resulting in an improved sensitivity. For a given niA hypothesis, values of 
the production cross section (parameterised by p.) yielding CLs<0.05, where CLg is computed using the asymptotic 
approximation [70], are excluded at >95% CL. 



















































niA = 60 GeV 


rriA = 80 GeV 


niA = 100 GeV 








FIG. 4: Distribution of the leading BDRS jet mass in the two analysis channels considered after final selection: (top) (>5j, 3b) 
and (bottom) (>5j, >4b), for different values of niA (60, 80 and 100 GeV). The prediction corresponds to ^/s = 14 TeV and 
an integrated luminosity of 30 fb“^. Several background categories have been merged for visibility. The expected contribution 
from the tiA signal under the assumptions of f/t = 2 and B{A ^ bb) = 1 is also shown (red histogram), stacked on top of the SM 
background. The dashed red line shows the tiA signal distribution normalised to the background yield to better compare the 
shape to that of the background. The bottom panel displays the expected total systematic uncertainty on the total prediction 
prior to the fit to the pseudo-data. 


V. ESTIMATED LIMITS ON A LIGHT CP-ODD SCALAR 


Following the analyses steps and the limit setting outlined in Sects. ra we estimate expected 95% CL upper 
limits on the production cross section times branching ratio, a{tiA) x B{A bb), as a function of rriA (see Fig. 

->■ bb) as a function of toa for different values of 
the upper limits on a{tiA) x B{A —)■ bb) can be 


Table 


III 


summarises the 95% CL upper limits on a{ttA) x B{A 
the integrated luminosity. Under the assumption B{A —)■ bb) = 1, 
translated on upper limits on \gt\, which are summarised in Table IV 

Using the reconstruction strategy outlined in Sec. IV A[ a CP-odd scalar that couples with gt = 1 can be excluded 
for 20 < toa < 90 GeV with only 30 fb“^ of data (see Fig. With an increased statistics of 300 fb“^ couplings as 


low as gt — 0.5 can be constrained over a large mass range, i.e. 30 < ruA < 80 GeV. 


VI. INTERPRETATION OF LIMITS 

A light CP-odd Higgs boson (ttia < 125 GeV), which may or may not be related to global symmetries being present, 
exists in many extensions of the SM. Its couplings with gauge bosons are generically suppressed, yielding weak bounds 
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Vs 


' I ' ' I ' ' ' ' I ' ' ' I I ' I I I I ' I ' ' I ' ' ' ' I ' ' ' ' 

. 95% CL Expected limit 

95% CL Expected limit ± 1 o 
:14TeV 30 fb ^ 95% CL Expected limit ± 2 o 

. 9 . = 2 

= ^:a5 




20 30 40 50 60 70 80 90 100 

[GeV] 


(a) 


(b) 


FIG. 5: Expected 95% CL upper limits on o{ttA) x B{A —>■ bb) as a function of rriA in pp collisions at ^ = 14 TeV for an 
integrated luminosity of (a) 30 fb“^ and (b) 300 fb“^. The green and yellow bands correspond to 1 and 2 standard deviations 
respectively around the median expected limit under the backgronnd-only hypothesis. Also shown are the theoretical cross 
sections for a{tiA) for different assumed values of gt (0.5, 1.0 and 2.0) and B{A —>■ bb) = 1. 



95% CL upper 

limits on o{ttA) x 

B{A^ 

bb) (pb) 




niA 

(GeV) 



C (fb-1) 

20 

30 

40 

60 

80 

100 

1 

4.46 

2.50 

2.38 

2.57 

2.78 

3.94 

30 

1.02 

0.48 

0.45 

0.51 

0.53 

0.78 

100 

0.67 

0.29 

0.25 

0.29 

0.30 

0.46 

300 

0.46 

0.18 

0.16 

0.18 

0.18 

0.30 

3000 

0.17 

0.066 

0.057 

0.065 

0.065 

0.13 


TABLE III: Expected 95% CL upper limits on a{ttA) x B{A —>■ bb) as a function of niA in pp collisions at y/s = 14 TeV for 
different integrated Inminosities. 




95% 

CL upper limits on 

l5t| 





rriA 

1 

0| 



C (fb-i) 

20 

30 

40 

60 

80 

100 

1 

2.73 

2.14 

2.18 

2.48 

2.82 

3.65 

30 

1.31 

0.94 

0.95 

1.10 

1.23 

1.62 

100 

1.06 

0.72 

0.71 

0.83 

0.93 

1.25 

300 

0.88 

0.57 

0.55 

0.65 

0.72 

1.00 

3000 

0.54 

0.35 

0.34 

0.39 

0.43 

0.67 


TABLE IV: Expected 95% CL upper limits on |grt| as a function of ruA in pp collisions at ^/s = 14 TeV for different integrated 
luminosities, under the assumption B{A —>■ bb) = 1. 


from LEP. If it may be searched via the decay hsu —>■ AA. Though such decay sometimes has a large 

branching ratio, being in conflict with current Higgs precision data, there do exist scenarios, in both supersymmetric 
and non-supersymmetric theories, where the H(/ism —f AA) is suppressed. Therefore, new strategies for collider 
searches that could cover as large as possible model parameter space with a light CP-odd Higgs boson, are necessary. 
Next, we will interpret our collider analysis of tiA in several representative beyond-SM scenarios. 
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A. 2HDM 


In the MSSM, a supersymmetric extension of a type-II 2HDM, a scenario with a light CP-odd Higgs boson is hard 
to achieve, given constraints from precision Higgs data. This is not surprising since there are only two free parameters 
at tree level in the Higgs sector, due to supersymmetric interrelations. The picture, however, is changed in the 2HDM 
without supersymmetry. With a softly-broken symmetry (<i>i —> $i, $2 —^> 2 ), which is often introduced to 

suppress scalar-mediated flavor changing processes, the Higgs potential of the 2HDM is given by: 

H($i,$2) = + (771^2^1^2 +h.C.) -b -b ^A2($^$2)^ 

-bA3($l$l)($^«>2) + A4($I«>2)($2 ®i) + ^A5[($I$2)^ + h.C.]. (4) 

Here $12 are complex SU{2)l doublets. Assuming no CP-violation, the model has two CP-even and one CP-odd 
spin-0 neutral eigenstates, denoted as h, H, and A, respectively. Such a setup contains seven free parameters at 
tree level (including all Higgs masses), yielding a large parameter space that can accommodate a light CP-odd Higgs 
boson. 

Theoretically, the 125 GeV SM-like Higgs boson hsu could be either the light CP-even Higgs boson (h) or the heavy 
one {H). If ruA < mhsM/‘^^ the decay hsM — t AA is kinematically allowed. Often the partial width for hsu —t AA 
becomes comparable or ever dominant over that of hsu —t bb, given that the latter is suppressed by the lightness of 
the bottom quark. Therefore, hsM —t AA decays become a good probe for these light bosonic particles. However, as 
discussed recently in the alignment limit [cos(/3 — a) = 0 if hsM = h, and sin(/3 — a) = 0 if /ism = H], which is 
favoured by current precision Higgs measurements, the Higgs coupling g^^-^AA is reduced to: 


|5/ism71j4 I 


2m\ -b - dmfa/ sin 2/3 

V 


(5) 


In case that 2 to^ -b ~ 4777 ^ 2 /sin 2/3, the decay /ism —>■ AA would be greatly suppressed. Therefore, collider 

strategies are needed to probe these scenarios with rriA < 'itism/2, as well as the scenarios with ttia > wsm/S. 

We should note that the perturbation requirement for Higgs couplings yields bounds on tan /3. Particularly, the 
coupling Ai is related to the Higgs boson mass via the relation m 


Ai — 


tan^ /3 — 777^2 (tan /3 -b tan^ /3) 


Assuming qu^-^^aa = 0, it becomes: 


Ai — 


777^ -b tan^ I3{m]j - m\ 12 - m\) 


( 6 ) 

(7) 


Given — m\^_^l2 — m\ > 0 for ruA < w;jg„/2, the perturbativity condition Ai < 47r immediately sets an upper 
bound for tan j3 in this region: 


tan/3 < 


47n;2 — m\ 




/ 2 -^ 


< 


47rv^ 


‘‘hSM 


/2-: 


10 - 20 . 


( 8 ) 


These features are illustrated in Fig. Additionally, the perturbation requirement for top Yukawa couplings can 
bound the tan/3 value from below. So we will limit our discussions for tan/3 > 0.1. 

The expected sensitivities for probing these scenarios in the 2HDM via bbA and tlA production are presented in 
Fig. 0 For illustration, we focus on type-I and type-H 2HDMs. The bbA reach is estimated based on the projections 
from Ref. [73], neglecting systematic uncertainties. Within a type-H 2HDM, the ttA and bbA channels are comple¬ 
mentary to each other in searching for light CP-odd Higgs bosons, since the coupling gbbA is tan /3-enhanced whereas 
gttA is cot ,d-enhanced. With integrated luminosities in excess of 300 fb“^, the whole parameter region can be covered 
except a corner with relatively large mA and moderate tan/3. This is interesting given that low tan/3 is particularly 
favoured by perturbativity. In contrast, within a type-I 2HDM, the coupling gi,i,A would also be cot/3-enhanced, so 


^ See Refs. 11211721 for discussions in the context of the NMSSM. 
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FIG. 6: Parameter region with B{hsM AA) < 30% in the 2HDM (blank region). The blank belts in the region with 
ruA < mhg^/2, which are characterised by different boundary colours, are yielded by different mi 2 values. The black dashed 
line represents a universal upper limit on tan/3 due to the perturbation requirement for Ai. 


20 40 60 80 

mA(GeV) 

(a) 



50.0 



40 60 80 

myi(GeV) 

(b) 


FIG. 7: Sensitivity reach (at 95% CL) of the bbA and ttA channels within the (a) type-I 2HDM and (b) type-II 2HDM. The 
green bands represent a region where the recently observed gamma-ray excess from the Galactic Centre can be explained, 
yielding a DM annihilation cross section of {av) ~ 1 — 2.5 x 10~^®cm^s“^. Here the DM particle mass = 50 GeV and the 
coupling between the mediator and the DM particles = 0.3 are assumed. 


both search channels are no longer probing complementary tan/3 regions. As a matter of fact, in such scenario the 
ttA channel provides a better sensitivity to search for the light CP-odd Higgs boson over the whole mass range of 
20 GeV < ruA < 100 GeV, although the high-tan /3 region remains difficult to probe. 

Searches for tiA and bbA also provide a probe for DM physics. For example, consider a Dirac fermion y that is a 
DM candidate, with mass m^, and coupling to the CP-odd scalar A via: 

£ D VxAxi'fx- (9) 

Integrating out A yields a dimension-six effective operator: 


( 10 ) 
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Such an operator implies s-wave DM annihilation xx with 


{av) 


3 vlglvlml /tT^ 

Stt {m\ — y ’ 


( 11 ) 


allowing an explanation for the recently observed diffuse gamma-ray excess from the Galactic Centre mill], and a 
spin-dependent and p-wave-suppressed direct detection signal, resulting in a weak bound from current direct detection 
searches. In Fig.[^ the tan P-niA values consistent with an explanation of the gamma-ray excess are indicated, yielding 
a DM annhilation cross section of {av) ~ 1 — 2.5 x 10“^®cm^s“^, with = 50 GeV m and = 0.3 assumed. In 
this scenario, monojet searches at the LHC would also be insensitive since the decay A —)■ xX would be kinematically 
forbidden, while the ttA, A ^ bb search would provide an effective probe. 


B. NMSSM 


Another class of benchmark scenairos for light CP-odd Higgs bosons arise in the NMSSM, with the superpotential 
and soft supersymmetry-breaking terms of its Higgs sector given by 


W = ASHuHd -f 

o 

Vsoft = + rriglSl^ — (XAxHuHdS + h.c.)-I-( —-|- h.c.), 


( 12 ) 


where Hj,, iL„ and S denote the neutral Higgs helds of the Hd, Hu and S supermultiplets, respectively. For conve¬ 
nience, let’s define its CP-even and CP-odd mass eigenstates as Hi, i = 1,2, 3, and Aj, j — 1, 2, respectively. 

In contrast with the 2HDM case, the light CP-odd Higgs boson in the NMSSM often results from breaking an 
approximate global symmetry spontaneously, serving as an axion or a pseudo-Goldstone boson. Its appearance is 
thus less “artificial”. Let us start with the tree-level mass matrix of the CP-odd Higgs bosons in the NMSSM: 


M% = 


which yields a determinant 


m. 


A^U^S2/3 


Au(^sin2/3-5^) 
sin 2/3+If) - 


3KA„fj. 

X 


m\ = 


2fi{A\ + Ks) 
sin 2/3 


det(AIp) = QkXv'^^Ax — 


A sin 2^ 




(13) 


(14) 


Necessarily, the scenarios with a light Ai {Ai denotes the lightest CP-odd Higgs boson) or —>■ 0 yield det(AIp) —>■ 0 
and viceversa, if such a stable vacuum exists. Among various possibilities, two have been studied extensively: R- 
symmetry (or R-limit) and Peccei-Quinn (PQ) symmetry (or PQ-limit), both of which yield a vanishing determinant 
at tree level. Another difference between these two class of scenarios is that the light CP-odd Higgs boson in the 
NMSSM is typically singlet-like. This can be understood since the Goldstone boson of a spontaneously-broken global 
17(1) symmetry is manifested as 


E 


qiVj 

VU{1) 


= S,Hu,Hd. 


(15) 


Here Up(i) = \/X) is the 17(1) breaking scale and qt is the 17(1) charge of An effective parameter ^ = X{vs) of 
the electroweak scale with A ^ 0(0.1) naturally yields vs S> Vu,Vd, and hence a singlet-like pseudo-Goldstone boson. 
This feature renders such a light boson much more difficult to probe at colliders, compared to the 2HDM case. Next, 
we will evaluate the collider constraints on these two scenarios. 


1. R-limit: A\ —)■ 0, A„ —)■ 0, where the theory is approximately invariant under the transformation 

Hu Hu exp(l(/)R), Hd -A Hd exp(i0R), S ^ S exp(#R,), 
and the tree-level couplings of the R-axion Ai with the top and bottom quarks are given by 


VAitt — 


2Xv cos^ /3 


VAibb = 


2Xv sim P 


(16) 


(17) 
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In this scenario, both A and k can be large, yielding a sizeable contribution to the mass of the SM-like Higgs 
boson at tree level. Hence, a large value for tan /3 is unnecessary. A scan in the parameter space in this scenario 
is performed using NMSSMTools 4.2.1 m including all built-in constraints, such as from Higgs searches, 
superpartner searches, muon g — 2, flavour physics, invisible Z-decay, and the constraints from T decays (with 
the exception of the Landau pole test and DM related-constraints, which are not considered). The resulting 
values for the i/Aitt and yAibb couplings are compared to the expected collider bounds in Fig. |^a). Depending 
on the parameter values, the magnitude of yAitt in this scenario can be up to ~ 0.5. Only for an integrated 
luminosity of 3000 fb“^ the LHC can probe a coupling yAitt as small as 0.5 via the ttAi, Ai -A bb channel. 
Therefore this scenario is difficult to probe, even at the HL-LHC. 



FIG. 8: Sensitivity reach (at 95% CL) to the R-limit scenario in the NMSSM via the (a) ttAi and (b) bbAi channels. The scan 
is over all parameters, in the ranges 0.1 < A < 0.6, 0.1 < k < 0.6, —7 < Ax < 7 GeV, —7 < < 0 GeV, 0.1 < tan/3 < 5, and 

100 < fj, < 500 GeV. We have assumed soft squark masses of 2 TeV, slepton masses of 200 GeV, Au,d,e = —3.5 TeV, and bino, 
wino and gluino masses of 100, 200, and 2000 GeV, respectively. The hue of the scatter points represents the correspondent 
tan/3 values. 


2. PQ-limit: ^ 0, -A 0, where the theory is approximately invariant under the transformation 

exp(/(/)pQ), Hd ^ Hdexp{i(j)pQ), S ^ Sexp{-2i4>pQ), 

and the tree-level mass of the PQ pseudo-Goldstone boson Ai are given given by 

SkAk^ 
mA^ = -r—. 


(18) 


(19) 


This scenario has been proposed as a supersymmetric benchmark for sub-electroweak scale (singlino-like) 
DM [75], since its lightest neutralino is generically singlino-like and lighter than the electroweak scale. Par¬ 
ticularly, in this scenario Ai can serve as the mediator for DM annihilation into a bottom quark pair and 
explain the diffuse gamma-ray excess from the Galactic Centre mm- In this limit, the tree-level couplings of 
Ai with the top and bottom quarks are given by 


Xv cos^ P Xv sin^ /3 

yAitt — : yAibb — j (^9) 

y M 

and so smaller by a factor of two than the corresponding couplings in the R-limit. Furthermore, a smaller A is 
favoured in this limit and a relatively large tan /3 is needed to generate a mass of 125 GeV for the SM-like Higgs 
boson. Therefore, the coupling yAitt tends to be smaller than in the R-limit scenario. The resulting values for 
the yAitt and yAibb couplings are compared to the expected collider bounds in Fig.|^b). For most of the points, 
the magnitude of yAitt is below 0.1, which renders this scenario extremely difficult to probe at the using the 
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ttAi channel.^ 



OTAi(GeV) mAi(GeV) 

(a) (b) 


FIG. 9: Sensitivity reach (at 95% CL) to the PQ-limit scenario in the NMSSM via the (a) ttAi and (b) bbAi channels. The 
scan is over all parameters, in the ranges 0.06 < A < 0.6, 5 < k/X < 100, |£'| = \Ax/ fJ-tan P — 1| < 0.25, —100 < Ak < 0 GeV, 
0.1 < tan/3 < 5, and 100 < < 500 GeV. We have assumed soft squark masses of 2 TeV, slepton masses of 200 GeV, 
Au,d,e = 3.5 TeV, and bino, wino and gluino masses of 100, 200, and 2000 GeV, respectively. The hue of the scatter points 
represents the correspondent tan/3 values. 


Finally, we stress that the bbAi channel doesn’t help much in probing the R- and PQ-limit scenarios. The sensitivities 
of both searches are suppressed by the mixture with the singlet. Even worse, the mixing is approximately tan/3 
enhanced, further suppressing the sensitivity of the bbAi in probing the large tan/3 region in both scenarios. 


VII. CONCLUSIONS 

Searches for CP-odd scalars, as predicted by many extensions of the Standard Model and motivated by some recent 
astroparticle observations, are part of the core program of upcoming LHC runs at y/s = 13 and 14 TeV. Searches at 
LEP and during Run 1 of the LHC at ^/s = 7 and 8 TeV have placed only weak constraints on the coupling strengths 
of CP-odd scalars with top and bottom quarks, or in their allowed mass range. 

Using a simplified model approach for the signal, we have carried out a detailed study to evaluate the prospects at 
the LHC for probing scenarios with a CP-odd scalar with mass 20 < niA < 100 GeV, via the process pp —>■ UA with 
subsequent decay A bb. To separate the signal from the large background from tt-|-jets production, we apply jet 
substructure techniques, reconstructing the mass of the CP-odd scalar as the mass of a large-radius jet containing two 
6 -tagged subjets. The chosen method allows for a so-called ’bump hunt’ over a fairly smooth background, and it may 
be the most promising strategy for searching for a CP-odd scalar with mass ^ 50 GeV, i.e. about twice the typical 
minimum px cut for narrow jets used in standard LHC searches. A significant effort has been made in developing a 
semi-realistic experimental analysis, including a fairly complete description of systematic uncertainties and the usage 
of sophisticated statistical tools to constrain in-situ the effect of systematic uncertainties, thus limiting their impact 
on the search sensitivity. We then derive expected upper limits on the production cross section times branching ratio 
using the CLg method. 

In specific models, e.g. 2HDM or NMSSM, the coupling of the A boson with the top quark is related to other 
couplings in a well-defined way. Hence, the upper limits obtained on this coupling for a given mass uia, can be used 
to bound other couplings of these models indirectly or as input for a global coupling ht. We find that in a type-I 
and type-H 2HDM the LHG can constrain a large fraction of the (m^,tan/3) parameter space, including the region 
preferred to explain the diffuse gamma-ray excess from the Galactic Gentre as dark-matter annihilation via a CP-odd 


® For alternative way to probing this scenario, using exotic Higgs decays, see e.g. Ref. 11211771 l78l . 
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scalar mediator and decaying into bb. However, in the case of the NMSSM with a light CP-odd scalar, a Goldstone 
boson of either a spontaneously-broken R- or PQ-symmetry, the LHC appears to have very limited sensitivity in 
probing these models. 

Hence, depending on the concrete embedding of the scalar sector into a UV-complete theory, the LHC can provide 
complementary information, not accessible at either indirect detection experiments or electron-positron colliders, on 
the existence of CP-odd scalars, their mass and couplings to third-generation fermions. 
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